Generalised Discount Functions applied to a Monte-Carlo AI u Implementation
نویسندگان
چکیده
In recent years, work has been done to develop the theory of General Reinforcement Learning (GRL). However, there are few examples demonstrating the known results regarding generalised discounting. We have added to the GRL simulation platform AIXIjs the functionality to assign an agent arbitrary discount functions, and an environment which can be used to determine the effect of discounting on an agent’s policy. Using this, we investigate how geometric, hyperbolic and power discounting affect an informed agent in a simple MDP. We experimentally reproduce a number of theoretical results, and discuss some related subtleties. It was found that the agent’s behaviour followed what is expected theoretically, assuming appropriate parameters were chosen for the Monte-Carlo Tree Search (MCTS) planning algorithm. Keywords— Reinforcement Learning, Discount Function, Time Consistency, Monte Carlo
منابع مشابه
Generalised Discount Functions
In recent years, work has been done to develop the theory of General Reinforcement Learning (GRL). However, there are no examples demonstrating the known results regarding generalised discounting. We have added to the GRL simulation platform (AIXIjs) the functionality to assign an agent arbitrary discount functions, and an environment which can be used to determine the effect of discounting on ...
متن کاملUncertainties due to Fuel Heating Value and Burner Efficiency on Performance Functions of Turbofan Engines Using Monte Carlo Simulation
In this paper, the impacts of the uncertainty of fuel heating value as well as the burner efficiency on performance functions of a turbofan engine are studied. The mean value and variance curves for thrust, thrust specific fuel consumption as well as propulsive, thermal and overall efficiencies are drawn and analyzed, considering the aforementioned uncertainties based on various Mach numbers at...
متن کاملImplementing The Generalised Hybrid Monte-Carlo Algorithm
UKQCD’s dynamical fermion project uses the Generalised Hybrid Monte Carlo (GHMC) algorithm to generate QCD gauge configurations for a non-perturbatively O(a) improved Wilson action with two degenerate sea-quark flavours. We describe our implementation of the algorithm on the Cray-T3E, concentrating on issues arising from code verification and performance optimisation, such as parameter tuning, ...
متن کاملDesign and Simulation of Photoneutron Source by MCNPX Monte Carlo Code for Boron Neutron Capture Therapy
Introduction Electron linear accelerator (LINAC) can be used for neutron production in Boron Neutron Capture Therapy (BNCT). BNCT is an external radiotherapeutic method for the treatment of some cancers. In this study, Varian 2300 C/D LINAC was simulated as an electron accelerator-based photoneutron source to provide a suitable neutron flux for BNCT. Materials and Methods Photoneutron sources w...
متن کاملSecondary Particles Produced by Hadron Therapy
Introduction Use of hadron therapy as an advanced radiotherapy technique is increasing. In this method, secondary particles are produced through primary beam interactions with the beam-transport system and the patient’s body. In this study, Monte Carlo simulations were employed to determine the dose of produced secondary particles, particularly neutrons during treatment. Materials and Methods I...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017